Boundary Correction of Protein Names Adapting Heuristic Rules

نویسندگان

  • Tomohiro Mitsumori
  • Sevrani Fation
  • Masaki Murata
  • Kouichi Doi
  • Hirohumi Doi
چکیده

In this study, we made some heuristic rules related to the boundary of protein names for automated extraction of protein names from biomedical literatures. The automated extraction of protein names was carried out based on Support Vector Machine (SVM). ¿From the analysis of the results, we found whether some words of modifier words set were included or not as part of protein names. It is critical whether the modifier words set is or not included in a protein name. Adapting some heuristic rules to the corpus, the F-score was improved about 1.3% (from 76.10% to 77.41%) compared with the case without adapting proposed rules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مدل ترجمه عبارت-مرزی با استفاده از برچسب‌های کم‌عمق نحوی

Phrase-boundary model for statistical machine translation labels the rules with classes of boundary words on the target side phrases of training corpus. In this paper, we extend the phrase-boundary model using shallow syntactic labels including POS tags and chunk labels. With the priority of chunk labels, the proposed model names non-terminals with shallow syntactic labels on the boundaries of ...

متن کامل

Heuristic Process Model Simplification in Frequency Response Domain

Frequency response diagrams of a system include detailed and recognizable information about the structural and parameter effects of the transfer function model of the system. The information are qualitatively and quantitatively obtainable from simultaneous consideration of amplitude ratio and phase information. In this paper, some rules and relationships are presented for making use of frequenc...

متن کامل

Thai Named Entity Extraction by incorporating Maximum Entropy Model with Simple Heuristic Information

The role of Named entity (NE) extraction is very important in many NLP tasks, such as information extraction, etc. In Thai, the problems of NE extraction are much more difficult due to the characteristics of Thai language, that are lack of orthographical information to signal NEs, and no boundary indicator between words. In this paper, we present Thai NE extraction system by using Maximum Entro...

متن کامل

A Framework for Adapting Population-Based and Heuristic Algorithms for Dynamic Optimization Problems

In this paper, a general framework was presented to boost heuristic optimization algorithms based on swarm intelligence from static to dynamic environments. Regarding the problems of dynamic optimization as opposed to static environments, evaluation function or constraints change in the time and hence place of optimization. The subject matter of the framework is based on the variability of the ...

متن کامل

Fuzzy Rules in Case-based Reasoning

Similarity-based fuzzy rules are proposed as a basic tool for modelling and formalizing parts of the case-based reasoning methodology within the framework of approximate reasoning. The use of diierent types of rules for encoding the heuristic reasoning principle underlying case-based problem solving is discussed, which leads to diierent approaches to case-based inference. A model which combines...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004